Feature engineering for MEDLINE citation categorization with MeSH
نویسندگان
چکیده
منابع مشابه
Multilabel associative classification categorization of MEDLINE articles into MeSH keywords.
The specific characteristic of classification of medical documents from the MEDLINE database is that each document is assigned to more than one category, which requires a system for multilabel classification. Another major challenge was to develop a scalable method capable of dealing with hundreds of thousand of documents. We proposed a novel system for automated classification of MEDLINE docum...
متن کاملA MEDLINE categorization algorithm
BACKGROUND Categorization is designed to enhance resource description by organizing content description so as to enable the reader to grasp quickly and easily what are the main topics discussed in it. The objective of this work is to propose a categorization algorithm to classify a set of scientific articles indexed with the MeSH thesaurus, and in particular those of the MEDLINE bibliographic d...
متن کاملAn Incremental Approach for MEDLINE MeSH Indexing
As an increasing number of new journal articles being added to the MEDLINE database each year, it becomes imperative to build effective systems that can automatically suggest Medical Subject Headings (MeSH) to reduce effort from human annotators. In this paper, we propose three approaches, one building upon another in an incremental way, to automatic MeSH term suggestion: 1) MetaMap-based label...
متن کاملClustering Citation Distributions for Semantic Categorization and Citation Prediction
In this paper we present i) an approach for clustering authors according to their citation distributions and ii) an ontology, the Bibliometric Data Ontology, for supporting the formal representation of such clusters. This method allows the formulation of queries which take in consideration the citation behaviour of an author and predicts with a good level of accuracy future citation behaviours....
متن کاملQueryCat: automatic categorization of MEDLINE queries
A searcher's inability to formulate an appropriate query can result in an overwhelming number of retrieved documents. Our approach to this problem is to use information about common types or categories of queries to (1) reformulate the user's initial query and (2) create an informative organization of the retrieved documents from the reformulated query. To achieve these goals, we first must ide...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: BMC Bioinformatics
سال: 2015
ISSN: 1471-2105
DOI: 10.1186/s12859-015-0539-7